System to assess genome sequencing needs for viral protein diagnostics and therapeutics.

نویسندگان

  • Shea N Gardner
  • Thomas A Kuczmarski
  • Carol E Zhou
  • Marisa W Lam
  • Tom R Slezak
چکیده

Computational analyses of genome sequences may elucidate protein signatures unique to a target pathogen. We constructed a Protein Signature Pipeline to guide the selection of short peptide sequences to serve as targets for detection and therapeutics. In silico identification of good target peptides that are conserved among strains and unique compared to other species generates a list of peptides. These peptides may be developed in the laboratory as targets of antibody, peptide, and ligand binding for detection assays and therapeutics or as targets for vaccine development. In this paper, we assess how the amount of sequence data affects our ability to identify conserved, unique protein signature candidates. To determine the amount of sequence data required to select good protein signature candidates, we have built a computationally intensive system called the Sequencing Analysis Pipeline (SAP). The SAP performs thousands of Monte Carlo simulations, each calling the Protein Signature Pipeline, to assess how the amount of sequence data for a target organism affects the ability to predict peptide signature candidates. Viral species differ substantially in the number of genomes required to predict protein signature targets. Patterns do not appear based on genome structure. There are more protein than DNA signatures due to greater intraspecific conservation at the protein than at the nucleotide level. We conclude that it is necessary to use the SAP as a dynamic system to assess the need for continued sequencing for each species individually and to update predictions with each additional genome that is sequenced.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مروری برتکنیک های توالی یابی D‏NA (نسل اول، نسل دوم و نسل سوم)

Introduction: The DNA sequencing is the most important technique in molecular biology by which the order of the nucleotides can be identified in a piece of DNA. There are several different methods for sequencing the DNA. Now, the DNA sequencing has great importance in the medical diagnostics and other medical fields. Some methods have been invented to speed up and increase the efficiency of the...

متن کامل

Cloning of Rota Virus Outer Capsid Protein (VP7) Gene into the pGEM Vector

Background and Aims: In humans the group A rotaviruses are associated with endemic diarrhea in children under the age of 5, leading to approximately 800,000 deaths every year. Introduction of rotavirus vaccines into childhood immunization programs can contribute to substantial reduction in mortality from rotavirus gastroenteritis in developing countries and virtually eliminating hospitalization...

متن کامل

Assessment of Foot and Mouth Virus Subtype O2016 Genetic Alterations During Successive Passages in BHK Monolayer

Abstract : Foot and Mouth Disease is one of  the important live stocks contagious viral disease caused by Aphtovirus genus ,  that is belong to family RNA virus  picornaviride. The important characteristic of FMD virus is high mutation that give rise to diversity of Antigen in surface of Neutralizing proteins. For this reason FMD virus have 7 distinct serotype and many subtype. Vaccination is o...

متن کامل

Sequencing needs for viral diagnostics.

We built a system to guide decisions regarding the amount of genomic sequencing required to develop diagnostic DNA signatures, which are short sequences that are sufficient to uniquely identify a viral species. We used our existing DNA diagnostic signature prediction pipeline, which selects regions of a target species genome that are conserved among strains of the target (for reliability, to pr...

متن کامل

Transcriptome Sequencing of Guilan Native Cow in Comparison with bosTau4 Reference Genome

RNA-sequencing is a new method of transcriptome characterization of organisms. Based on identity and relatedness, there are large genetic variations among different cattle breeds. The goal of the current study was to sequence the transcriptome of Guilan native cow and compare with available reference genome using RNA-sequencing method. Blood samples were collected from 14 Guilan native cows and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of clinical microbiology

دوره 43 4  شماره 

صفحات  -

تاریخ انتشار 2005